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Abstract 

In this paper we investigate the achievable rate of a system that includes a nomadic transmitter with several 
antennas, which is received by multiple agents, exhibiting independent channel gains and additive circular-symmetric 
complex Gaussian noise. In the nomadic regime, we assume that the agents do not have any decoding ability. These 
agents process their channel observations and forward them to the final destination through lossless links with a fixed 
capacity. We propose new achievable rates based on elementary compression and also on a Wyner-Ziv (CEO-like) 
processing, for both fast fading and block fading channels, as well as for general discrete channels. The simpler 
two agents scheme is solved, up to an implicit equation with a single variable. Limiting the nomadic transmitter 
to a circular-symmetric complex Gaussian signalling, new upper bounds are derived for both fast and block fading, 
based on the vector version of the entropy power inequality. These bounds are then compared to the achievable 
rates in several extreme scenarios. The asymptotic setting with numbers of agents and transmitter's antennas taken 
to infinity is analyzed. In addition, the upper bounds are analytically shown to be tight in several examples, while 
numerical calculations reveal a rather small gap in a finite 2x2 setting. The advantage of the Wyner-Ziv approach 
over elementary compression is shown where only the former can achieve the full diversity-multiplexing tradeoff. We 
also consider the non-nomadic setting, with agents that can decode. Here we give an achievable rate, over fast fading 
channel, which combines broadcast with dirty paper coding and the decentralized reception, which was introduced 
for the nomadic setting. 

Index Terms 

MIMO, Decentralized detection, wireless networks, Wyner-Ziv, CEO, compress-and-forward 

I. Introduction 

In this paper we deal with a network in which a nomadic transmitter has several antennas and is communicating 
to a remote destination, where no direct link exists between the transmitter and the final destination, as is depicted 
in figure Q] The final destination receives all of its inputs from several separated agents, which are connected to 
it through fixed lossless links with a given capacity. This setting is identical to the setting of [1], only that here 
we focus on fading channels. Namely, the channel between the transmitting antennas and the agents is a Rayleigh 
fading channel with independent channel gains, where the extension to other fading statistics is straight forward. In 
this contribution we consider both fast fading and block fading channels. The channel fading coefficients, or channel 
state information (CSI) are known in full to the agents and the final destination, but not to the transmitter. This 
setting is closely related to the setting of the multiple input multiple output (MIMO) channel, which is thoroughly 
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treated in the literature, see [2]. The multiplexing gain is a common asymptotic measure of performance of MIMO 
systems. It assesses the capacity increase, for high signal to noise ratios, due to the use of multiple antennas [3] in 
the scheme. In this paper, we analyze the multiplexing gain for the suggested network, where recent examples for 
the multiplexing gains of multi terminal networks are [4] and [5]. The results reported here have implications on 
other MIMO-related channels, such as the MIMO broadcast channel [6], the MIMO relay channel [7], and ad-hoc 
networks [8]. All these works deal with situations where multiple antennas are transmitting and the signals are 
received in a distributed fashion, either by relays, destinations or any combination of the above. In addition, results 
regarding ad-hoc networks [9], relay channels [10] and joint cell-site processing [11] are closely related, providing 
yet another aspect of the achievable rates in wireless networks, where relays form, in a distributed manner, the 
required spacial dimensions. 

Our model assumes that the transmitter is nomadic, which means that the agents do not possess the codebook 
in use, and thus do not have any decoding ability [1]. A good way to model a nomadic setting is by letting the 
transmitter use random encoding. Such model excludes any decoding from the agents. Given that the codebook is 
random, we further assume that it is Gaussian. In this case, as the model becomes close to source coding and the 
Gaussian CEO (Chief Executive Officer) [12], we were able to obtain analytic expressions for an achievable rate 
and for upper bounds. Relevant works here are distributed source coding by Wyner and Ziv (WZ) [13], [14] who 
deals with the multiple terminals WZ problem and the Gaussian CEO by [15], among others. 

The achievable rates derived in this paper extend the achievable rates from [1] to the case of fading channels, and 
multiple antennas at the transmitter and at the receiver. The techniques that are used for the derivation are based 
on the well known CEO or WZ distributed source coding. These techniques, although intended for source coding 
problems, enable better utilization of system resources also for channel coding problems, as done for example by 
[16],[17] and [18]. 

The upper bounds in this paper were derived using the vector version of the entropy power inequality, which 
was used for several known problems which are based on Gaussian statistics. These include the MIMO broadcast 
channel [6] and the Gaussian CEO with quadratic distortion [15]. Several generalizations to the original entropy 
power inequality exist, among them are [19], [20] and [21]. 

The Gaussian signaling used by the transmitter results with the channel outputs being Gaussian and for the 
nomadic setting, also memoryless. Notice that unlike traditional source coding problems that use the CEO or WZ 
techniques, and examine the resulting distortions, we focus on the allowed communication rates. Thus any upper 
bounds or even optimality shown for a source coding problem, although strongly connected, is not identical to our 
problem. Therefore the technique used to show optimality of the distributed WZ with two terminals problem ([14]) 
does not carry over to our setting. 

This paper is organized as follows, in section HI] the setting is described and the basic definitions and notations 
are given. Section [Hi] describes the elementary compression approach and gives several results about the achievable 
rates when using this approach. Section [IV] improves upon the approach taken in section [Til] by including CEO 
compression (as in the CEO problem) at the agents and the final destination. An upper bound to the achievable 
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rate, when using nomadic transmitter and non-decoding agents, is given in section [V] and then demonstrated by a 
numerical example, to be rather close to the achievable rate when using the CEO compression. In the last section, 
an achievable rate for when the agents are informed of the codes used by the transmitter, and the transmitter is 
informed of both agents' processing and channel coefficients is given in section [VIll Concluding remarks are then 
made in section [Villi 

II. Setting and Model definition 

Throughout this paper, boldfaced letters are used to denote vectors X of length n, calligraphic letters T to 
denote sets, capital letters X are usually used for random variables, and lower case letters for realizations of random 
variables x, indices i,j,k, and counters n,r,t. Subscript denotes an element within a vector and superscript X r 
denotes the set X\, . . . , X r . 

The statistical mean is denoted by E, * denotes the transpose conjugate and £A/"(S, S) stands for complex 
Gaussian random variable with mean EE and covariance E. 

An example for the model appears in Figure Q] The model consists of a transmitter S which has t transmitting 
antennas and which transmits during n channel uses. In each channel use, the transmitter sends a vector X G C[* xl l 
to the channel, where ~ X^fe=i E[X(A;)*X(fc)] < P. The transmitter uses circular-symmetric complex Gaussian 
signalling, which is known to be optimal for various problems involving the Gaussian channel. The communication 
rate is denoted by R. The message to be sent M is encoded by a random encoding function X = 4>s,f(M) such 
that for all messages M, the outputs of the encoding function are randomly and independently chosen according 
to probability Px{x).We indicate the random encoding function by a random variable F. That is, 

^ F :[l,...,2 nR ]^X n . (1) 

The agents are not informed about the selected encoding F, but are fully aware of Px- 
We have r agents A\, . . . , A r , each receiving the scalar channel outputs 

Yi(k) = Hi{k)X{k) + Ni(k), i = 1, . . . ,r, k = 1, . . . ,n (2) 

where Hi(k) £ Cl lx *l is the vector of the channel transfer coefficients, which are either ergodic (fast fading) or 
static, non-ergodic (block fading). In both cases, the coefficients are distributed independently from each other, and 
from any other variable, according to circular-symmetric complex Gaussian distribution CW(0, 1). Similarly, the 
noises are distributed as Ni(k) ~ CAf(0, 1), and are independent of each other and along time. For the sake of 
brevity, we drop the time index k from now on. 

Most of the results which are reported here can be easily extended by including other fading distributions, such 
as Ricean, invoking the results of [22]. 

The r agents are connected to a remote destination D with lossless links, each with capacity Cj bits per channel 
use. The final destination D decodes the message M from the r messages, which are sent from the r agents, where 
decoding function is 4>d,f '■ [1,..., 2^ nCi ] — > [1, . . . , 2 nR \. This setting is depicted in figure [T] 
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Fig. 1. A system that includes a transmitter with t = 2 and two agents A\ and A 2 (r = 2), connected to the final destination with capacities 
of Ci and C'2, respectively. The channel fading coefficients H are designated by {if; j}. 



For fast fading channels, the rate R is said to be achievable, if for every e > 0, there exists n sufficiently large 
such that 

Pr(M ^ m\M = m) < e, 



(3) 



m— 1 



where Pr(M ^ m\M — m) includes averaging over the channel and the random coding. In parallel, the rate-vs- 
outage probability of e, for block fading is said to be achievable if there exists n sufficiently large such that 



2»» 

^ ^ Pr(M ^ m|M = m) < e. 



(4) 



m— 1 



The transmitter is nomadic [1], that is the codebook that is used /, is unknown to the agents, but is fully known 
to the final destination. This way the agents treat input signals not accounting for the coded transmission, in a CEO 
or multiple WZ approach. All the reported results in this paper assume that the transmitter is limited to using only 
a circular-symmetric complex Gaussian codebook. Notice that the Gaussian codebook is not necessarily optimal, 
(a counter example exists for the non fading case, where using binary signaling at the transmitter with a simple 
two level demapper at the agents can outperform the Gaussian signaling scheme, see [1]). However, the Gaussian 
codebook does provide a good candidate, as for d — > 00 and C — > the Gaussian codebook is indeed optimal. 

In addition to the nomadism, the transmitter has no information regarding H — {H(k)}% =1 , where 

" tfi(fc) 
H(k) = : 

H r (k) 

while the final destination is fully informed about H. By default, each agent has the full CSI H. However, many 
of the presented schemes require each agent to know only its own channel coefficients H i7 as is stated in the text. 

Although the transmitter is unaware of the channel realizations H, it does have the full knowledge of the channel 
statistics, as well as {Ci}, which is used to calculate the rate in which the transmitter will encode its messages. 
Alternatively, higher layer control layers can indicate the code-rate which is to be used, based on an ACK/NACK 
mechanism. 

As said, the multiplexing gain of any scheme describes the scaling laws of its capacity, as P is increased [3]. 
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Definition: The multiplexing gain of a scheme is defined as 

R 

m = lim - — ; — -, (5) 
P^oc log(P) ' 

whereas the diversity is defined by 

d= lim _ lo g( Pr {outage}) 

P^oo l0g(P) 

We will use = and < to denote equality and respectively inequality, under the operation limp^oo j^^pj- The 
norm \V\ 2 for a vector V is defined as \V\ 2 — J^i l^| 2 - 

III. ELEMENTARY COMPRESSION SCHEME 

In this section, a scheme that incorporates elementary compression at the agents is analyzed. By elementary 
compression, we mean compression process that does not use the correlations between and thus does not 

require the agents to have full CSI, rather, they just need Hi. In addition, the implementation of such compression 
and especially the decompression are rather simple and realized with low complexity algorithms at the agents and 
the final destination. 

A. Ergodic Channel 

We first propose an achievable rate for general ergodic channels. 

Proposition 1: An achievable rate for an ergodic channel, with elementary compression is 

R EC = I(U r ;X\H), (7) 
(EC stands for elementary-compression) with the constraints: 

I{Ui-,Yi\H) <C h i=l,...,r, (8) 

where 

r 

\X,H{Ui\x, fyPu.^.HiUiWii h). (9) 

i=l 

The proof involves the random generation of codewords, TJ \ according to ni-=i Pui\ii{Ui(k)\H (k)), as done in 
standard rate-distortion problems with non-casual side information (H). These codebooks are made available to 
both all encoders and decoder. The proof appears in Appendix [Hi 
Applying Proposition Q] to the Gaussian MIMO channel, one gets 

Proposition 2: An achievable rate for ergodic setting using elementary compression is equal to: 



Rec = max E# 

when the maximization in ( liOD is such that 



where 



log 2 detf/ r +diagfl-2" 9l(H)N ) HQH 



(10) 



log 2 ((2<^> - 1) (HiQHt + 1) + ljj < d, i = 1, . . . ,r, (11) 
V = {Q: Qu - for j, Q M > 0, trace(Q) < P}. (12) 
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Here each agent employs the elementary compression scheme which is based on an underlying additive circular- 
symmetric complex Gaussian noise channel Ui = Yi + Di, where Di is the compression noise. As in the Gaussian 
CEO problem, there is a difference between the used formulation and the backward channel Yi = Ui + Di, used 
for standard rate-distortion compression. 

Another issue here is that the known fading affects the variance of the compression noise. The quantization 
noise is circularly-symmetric complex Gaussian with variance Pu^k) that depends on H(k). Let us further define 
qi(h) — I(Yi\ Ui\X, H = h), which, due to the Gaussian model, is equivalent to: 

1 



= 1-2-*. 
1 + Pd, 

, r, qi is a function of H = {Hi} r i=1 and thus is a random variable. 



(13) 



Notice that for all i = 1, 

It is easy to verify that the optimization problem in Proposition [2] includes a concave objective function ( fTOb but 
a non-convex domain ( fTTT l. Notice that since HQH* is distributed the same as HVQV* H* for any unitary V, we 
can still limit the search for optimal Q to non-ordered elements of a diagonal Q (which is the set V). 

Remark 1: Despite the name elementary compression, it requires an infinite number of codebooks at the agents 
and the final destination, since they should correspond to infinitely many fading coefficients. 

1) An Achievable Rate when r, t — > oo: Let us consider the case where r = rt, symmetric agents with constant 
total capacity from the agents to the final destination (Cj = Ct/r, i — 1, . . . ,r).Such scheme can account for 
bottleneck effects in the channel between the agents and the final destination. Let us take r — > oo, and find the 
limiting rate which is reliably supported by the scheme (f = Sisfc*} ) pj rst consider that when t — > oo, we have 
^ Hl * — > 1, almost surely. Applying this to (Tl3T > and (fTTl i. and also setting Pd to be identical for all the agents 
(the maximal Pd, unlike what was done in d25l l. which set Pd to be the minimal), we get 



lim Rec < t lim r^H 

r — >oo r — >oc 



log 2 1 



PX/t 



1 + Pi 



D„ 



t lim rE„ 



log 2 1 



ttPv 



1 



P+i 



(14) 



where A is one of the unordered eigenvalues {A^} and v = 



A 



mm{t,r} 

We can exchange the order of the expectation and the limit due to dominant convergence 



is a random variable with some finite mean. 



lim Rec < fE v 



lim rlog 2 1 



ttPv 



+- p+1 

^ 2 c t /-_i 



Ci 



ttPv(1 + P) 
(1 + P) 2 



c t 



P 



1 + P' 



(15) 



1, the inequality in equation (Q3) is in fact an equality. 



where Ev = max{r, -}, Since also argmax 1<i<; 
Thus we have the following corollary: 

Corollary 1: In the limit ofr,t—* oo, an achievable rate using elementary compression is Ctj^-p- 
Discussion: This result can be explained by noticing that the MIMO channel capacity is approximately linear with 
r when P is fixed, which leaves the fixed Ct to limit the performance, where we can not get to Ct because of the 
nomadic setting. In addition, this rate reaches Ct in the limit of large P, as expected. Notice that the rate (Tl~5T > does 
not depend on the ratio between the number of receive and transmit antennas, r. This is because the signal to noise 
ratio (SNR) at the final destination, from every antenna, is very small (Ct/r = log(l + ^4=p) — > Ct/r = ^jp). So 
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that the total SNR at the final destination is — Ct _u£^ F \ r , and the achievable rate can be calculated as (small 

py. 

log 2 \I + P'/tHH*\^rP'. (16) 
Notice that ( fTBT l indeed does not depend on r. Taking P' = Ct +(i+ P y in dTBT l results with ( fl~5l >. 

B. Block fading channel 

For block fading channel, the Shannon capacity is zero, and the concept of rate-vs-outage is the leading figure 
of merit. 

1) Rate vs Outage: 

Proposition 3: The rate-vs-outage region for the block fading channel, is calculated using the same equations 
( 1701 ) and ([77}, used for the fast fading channel only without the expectation over H. This results with an achievable 
outage probability for rate R, calculated as 

Pv{outage) = min Pr (r > log 2 det (l r + diag ^ ^^. j H QH*^j ) . (17) 
The underlying MIMO channel enables us to analyze the proposed schemes using the diversity multiplexing tradeoff. 

2) Diversity Multiplexing Tradeoff (DMT): An analysis for the diversity-multiplexing tradeoff is given next. The 
diversity and multiplexing are defined in the end of section [TT] Since our links are lossless, any outage event in the 
system is due to the underlying block fading channel. Thus, we fix all these links to carry information in the rate 

C i = jlog 2 (P) + e,i=l,...,r (18) 

where m < min{r, i} is the multiplexing gain which is used by the system, and e > is some fixed positive 
constant. 

Proposition 4: The DMT d{m) of any scheme with C{ as in M8\) and non-ergodic block fading underlying 
channel, is upper bounded by the minimum between the piecewise linear function of {k,{r — k)(t — k)), for 
k = 0, . . . , min{i, r} and 

«(!-=), 09) 

where m stands for the multiplexing gain. 

For example, the maximum diversity achieved here is with m = 0, which results with d(0) = t, which is smaller 
than rt. This result can be understood by considering that when m = 0, the capacity of the links between the 
agents and the final destination are very small. So that getting good channel between the transmitter and only one 
agent will not suffice to forward the information. So we need a good channel at every agent, which results with 
diversity order of t and not rt. 

An implication of the result is with respect to the MIMO broadcast channel. In order to achieve the full 
multiplexing gain in a MIMO broadcast channel, the transmitter is required to have full CSI [6]. Here, an elementary 
compression scheme, with limited cooperation between the destinations achieves the full multiplexing gain without 
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channel state knowledge at the transmitter (which usually requires some feedback). Further, such cooperation is 
usually easier to obtain when the destinations are co-located. 

Proof: The proof is based on the cut-set bound [26]. For any covariance constraint E[J1*] = Q, and channel 
H, any achievable rate is upper bounded by the cut-set bound, for any cut S C {1, . . . , r} 

R c = I(X; Y S \H) +J2 C 3= lo S 2 det(/, S | + H S QH* S ) + (r - \S\) [- log 2 (P) + e] . (20) 

So that for any scheme that achieves the rate R(H) for channel H, with input covariance Q, the probability of 
outage is limited by 

V R* > : Pr{R{H) < R*} > Pr{R c (H) < R*}. (21) 
Now we can calculate the upper bound on the DMT: 

log(Pr(outage)) 



d(m) < 



Urn 

P^oo 



log(P) 

,. min Qg p logPr (log 2 det (l {sl + H S QH* S ) < f\S\ log(P) - \S c \e) 
mm — lim ■ ; — ■ -. (22) 

SC{l,...,r} P^oo lOg(P) 



Using [3], for each S we get that the diversity ds(m) is the piecewise linear function connecting points (k, (\S\ — 
k)(t — k)), with as the argument. Next, we need to minimize this ds{m) over all subsets S. Since Pr{0 < 
— re} = 0, we can limit the search space to subsets that include at least one element. Define s = |«S|, so that we 
can use 

Pr{outage with s} > s p- ds{m) = p- d ^ m ) . (23) 
Let us use the underlying functions of ds(m), before applying the piecewise linear operation 

min (s [t . (24) 

i< s <r \ r J \ r J 

The minimum of d24l) is obtained by either taking s = 1 or s = r, regardless of m. Since the piecewise linear 
function exhibits the same behavior, we get Proposition |4] ■ 

Corollary 2: The elementary compression achieves the full multiplexing gain, but fails to achieve the DMT. 
Proof for Corollary^ 

1) Next we show that elementary compression suffices to achieve the full multiplexing gain m = min{r, t}. 
The first step is to lower bound (TlOt by a specific choice of Q and ■ We can lower bound ( fTOt by taking 
Q = y/ t and the following suboptimal quantization noise power Pr> i = ^IjJ'i^ 1 an d by further taking 

Pd — Pd v i = argmax{P Dj } 

P 1 



R 



EC 



> E if logo det I, 




PK 
t 



, (25) 
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where {A^} are the eigenvalues of HH*. Now since for i = 1, . . . , m we have that \ > 0, 



lim ■ 

P^oo 



log 2 1+1 



PA, 
t 



we get 



log 2 (P) 



,. Rec 
lim 7— r = m. 



m 
r 



P^oo log 2 (P) 



(26) 



(27) 



2) As for the DMT achieved by elementary compression ((Iec), upper bound the outage probability from equation 
( fTTI i. and calculate the resulting diversity 



d<Ec( m ) < — li m 

P^oo 



logPr (log 2 det (j P + Pdiag {^f^}^ PP*) < mlog(P) 

log(P) 

logPr (log 2 det (J r + t (Pv2 £ - l) P^) < mlog(P)) 
= "^00 lo^P) ' (28) 

where the inequality in d28l ) is since pj^^w^ui < pytfgT 5 smce ^°&2 det (7 + HQH*) < log 2 det (I + 
PHH*) for any diagonal Q with trace(<5) < P [3]. The matrix P^ is defined as 



Hz = 



Next, since det(P™P) = P m we have the equality 



\H r \ 



Pr (log 2 det (J r + t (P^¥ - l) HzH*2) < mlog(P)) = 

Pr (log 2 det (P"^J r + t (2 e - P-t 1 ) H^Hl) < 0) . (29) 

Taking the limit with respect to P, one gets 

^lim Pr (log 2 det (P~^J r + t (2 e - P~^) P/P^) < 0) = Pr (log 2 det (P/P^) < -(t2 e ) r ) . (30) 

Using Hadamard's inequality, as long as r > 1, the limit of the probability in (l30l is strictly larger than zero, 
so that when taking the logarithm and dividing by log(P), one gets that 



d E c{m) = 0, 



(31) 



for all m > 0. So the optimal DMT is not achievable using elementary compression, for more than a single 
agent r > 1, and multiplexing gain of more than zero. ■ 
3) This sub-optimal DMT is since there exist correlations between the received signals at the different agents 



\H,H*\ 



> 0, we get that |P/P^| < 1. As these correlations decrease, for example, by taking t to be very 



large compared with r, the outage probability becomes smaller. In the next section, we will exploit these 
correlations by a CEO approach, to reach the optimal DMT. 



February 2, 2008 



DRAFT 



9 



IV. CEO BASED SCHEME 

In this section we consider the same setting as in the previous section, but use the technique from [23], that 
is compression followed by bining, for better utilization of the capacity of the links, exploiting the correlations 
between the received signals at the agents. 

A. Ergodic Channel 

Proposition 5: An achievable rate, for the ergodic channel, when using CEO compression is: 

Rceo = max J min V [C t ~ I(Y f , U t \X, H)\ + I(U S ;X\H) I I , (32) 

Pu l \Y % , H {u t \y t ,h) ^SC{l,...,r} \£7a J J 

where 

r 

Px,Y r ,H,u>-(x,y r ,h,u r ) = Px(x)PH(h)Y\_PY i \H,x(yi\h,x)P Ui \H,Y i {ui\h,y i ). (33) 

i=l 

Proof guidelines: The proof involves the random generation of TJ i according to rifc=i Rui\H( u i(k)\h(k)), and then 
randomly partitioning the resulting code book into 2 nCi bins, as done in a WZ or a CEO based quantization. Then, 
each agent selects Ui which is jointly typical with the received (Yi, H). It proceeds by sending the corresponding 
bin index to the final destination through the lossless link. The final destination knows H and the bins in which 
U r fall in. Finally, the destination looks for (X, U r ) which is jointly typical, and from deciding on the transmitted 
X, declares the decoded message. 

The formal proof is by degenerating Proposition [15] such that W r are constants, and the random encoding, which 
is represented by / is known to all parties. ■ 

Focusing on the Gaussian channel, for the fast fading channel using ( l32l the following proposition is derived. 

Proposition 6: An achievable rate when using CEO compression over Gaussian channel with fast fading is: 

J2 [Q - di{H)] + log 2 det (l| S | + ^diag (l - 2-«W) . H S H* S 

(34) 

where H s = {Hi} ieS - 

The Proposition is proved by using the underlying channel Pu\y.h f° r me compression, such that the quantization 
noise is independent of the signal, as done for the elementary compression scheme in section [Til] Similarly, define 
Prji as the power of the circular-symmetric complex Gaussian quantization noise and qi(H) is the corresponding 
parameter, calculated as ([T3~l l. 

The rate in (|34T > is calculated assuming signalling with Q = ^rh- The proof that such signaling indeed maximizes 
the achievable rate is relegated to Appendix Hill This means that the achievable rate from Proposition [6] applies also 
to the sum-rate of multi access channel, see [22]. Notice that although introducing correlation in Q improves the 
compression, since it uses the correlation to save bandwidth, it comes on the expense of the achievable rate, due 
to the reduced degrees of freedom. Thus, the total rate is still maximized by taking Q = ^-h- 

Remark 2: The optimization over qi in the above problem is a concave problem, and thus can be efficiently 
solved. The optimization results with an achievable rate, while assuming full knowledge of CSI (H) in the final 



Rceo 











max 


,J 


min < 




{ 9i :C[ r><t ]^R + 




5C{l,...,r} 
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destination and in all the agents. However, this requirement does not impose severe limitations. This becomes 
evident in the sequel where Correlations [5] |7] describe special cases, where qi (H) is fixed, so only Hi is required 
at the agent. 

Notice that (f34T > includes joint optimization over all possible channel realizations. A simpler non-optimal approach 
is to optimize separably for every channel 



Rceo.2 = max E ff 



max min \ V [d - Qi ] + log 2 det + diag (l - 2~«) S H S QH$\ \ . 

{0< gi }t =1 5C{l,...,r} [ £g c V' J j 

(35) 



Unlike many channel coding problems over fast fading channels [24], [25], where there is no loss in optimality 
when using different codebook for every channel realization, here there is a strict gain to using a single codebook, 
such that the decoding is done jointly over the different realizations of H. So that Rceo,i < Rceo, with high 
probability. 

Remark 3: As in [1], Eq^ in both (l34l i and (l35l l, can be interpreted as the rate wasted on the compression of 
the additive noise by the i th agent's processing. So that, for example taking S = {</>} in ( f34t . results with the 
achievable rate of X)i=i ~ ^HQi}- Of course, this represents only one of 2I 5 elements within the minimum. 

Remark 4: When the agents do not have H, but rather only Hi, that is, each agent has only its own channel to 
the transmitter, and not the channels of the other agents, the optimization in Proposition [6] is done in this case over 
qi : Cl lxt l -> M+. 

Next, we give a solution to the optimization problem issued by Proposition [6] for the symmetric case with r = 2. 
Such setting results with 

Proposition 7: An achievable rate for the symmetric setting with r = 2 and ergodic setting is equal to 



R C EO = 2(C~E H [ qi (H,e)}), 



where (\a\ + — max{a,0}J 



- log 2 ( 



1+0 £|ff,| 2 



> f h(a+\hI\s) 



lo. 



82 



■ A+|g 3 -ir 



(36) 



(37) 



F H {0)) eKFai-zfap), 



with 



F H (9) 
A 



2(1 



1 + 20- 



'(1 + 20) 2 - 40(l + i 



(A+ ± + |g 1 |2 + | g2 |2 )A 
(A + | J ff 1 p)(A+| J ff 2 p) 



- det (HH*) 
t x ' 



and > which is set such that 



P 



E H log 2 det h + -diag ( 1 - 2~ 9 < {H 



The proof is relegated to Appendix [IV] 



HH* =2(C-E H [ qi (H,6)]). 



(38) 

(39) 
(40) 

(41) 
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Fig. 2. The resulting compression parameters q\ and 52, as function of 9, the Lagrangian for some specific H, and also average results when 
averaging over 1000 channels H, t = 2 and P = 7 dB. 



An intuition into the solution offered by Proposition [7] is by considering both q\ and q 2 . For that, assume 
| 2 > \H 2 \ 2 , then Ffj 1 (a+j^) < (s+f^p) and 

.1+f l^i I' 





P, ,9 9i = 

-ffil <0: < (44) 

92 = 0. 



t 



This reveals the structure of the optimal solution, which can be described as a variant of the famous "water-filling". 
This is since as in classic water-filling, depending on the available bandwidth, the parameter 9 determines how 
the compression depends on the channel realizations. When C is very large, 9 is very small, and fewer channel 
realizations result with q\ = q 2 = d44l i. When q% = q 2 = the scheme does not relay any information regarding 
the channel outputs, thus saving bandwidth for better channel realizations. 

This is demonstrated in Figure [2] for P = 7 dB, and 2x2 system, where the averaged maximum and minimum 
of qi and q 2 , over 1000 channels is depicted, as function of the Lagrangian 9. It is observed that the average 
difference between the two compression parameters qi and q 2 is about 0.4 bits/channel use. Figure |2] also draws q\ 
and q 2 for some specific channel H. It is seen that q 2 is always larger than qi, since |-ffi| 2 < \H 2 \ 2 , for the specific 
channel. Form 9 = 2 on, q\ = 0, which means that no information is sent from agent Ai to the final destination 
for this channel realization, when 9 > 2. 
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1) An Achievable Rate when r, t — > oo: For the case where r/t = r, = Cj/r and r — > oo, we repeat the 
suboptimal assignment and again fix qi = q* = eCt jr. 
Next, we define q t = rq* = eCt- Now we can write for any S: 

\s\vt 



log 2 



= J2 log 2 (l+P/<(l-2« t /'-)A l ) T 5 rElog 2 (l + P(l - 2^/ r )TT S v s ) 

(45) 



Where = ^ s J, vt , v$ — \ J\\jt anc ' V denotes min. 

Now we can exchange the order of the expectation and the limit due to dominant convergence: 

151 



r 5 E 



lim rlog 2 (l + P(l - W^ttsVs)] = rrjg t PE[i/ 5 ] - Pft^, (46) 



since E[^] = max|-^, -r^y j- On the other hand, for that same S: 



Ypi-<b\ = ^riCt-qt). (47) 

Next, we set q t , such that the right hand sides of ( |46l > and ( |47] i are equal. This results with the achievable rate of 
Rceo = Ct — qt — Ctp^i ■ Notice that this rate is identical to the elementary compression (JT3J. One would expect 
that the Wyner-Ziv approach will improve as t is increased, because then the correlations between the received 
signals is increased, improving the compression rates. However, from ( fT~5b . it is observed that for small powers, the 
mutual information is independent of t, so that also the correlations between the received signals Yi are independent 
of r. In addition, from the discussion below Correlation Q] it is evident that the equivalent signal to noise ratio 
when received at the final destination is very low, so that the inter-agent correlations are also low, diminishing the 
effect of the CEO compression. 

B. Block Fading Channel 

As in elementary compression, here we again use the rate-vs-outage figure of merit, and then also give the DMT 
for the CEO based scheme. 

1) Rate vs Outage: For the non-ergodic block fading channel, equation d34l i, stands for the averaged mutual 
information. Since the rate-vs-outage is not concave with respect to Q, as in the fast fading channel, Q — yj is 
no longer optimal [2], and we need to optimize also over Q. 

Proposition 8: An achievable rate R is correctly received over a block fading channel, with an outage probability 
of at most e, as long as the following holds (obtained from ( 1541 ) ): 



Pr max 

\Q£V,{0<qi<CiY i=1 I sc 



min ^ |log 2 det (/| S | + diag (l - 2~«) . g5 H S QH* S ) + £ [Q - q t ] j j < p) < e 

(48) 

where the probability is with respect to H. 
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2) Diversity Multiplexing Tradeoff (DMT): The CEO approach can get to the upper bound of the DMT, and thus 
gives the optimal DMT. 

Proposition 9: The full Diversity Multiplexing Tradeoff dim) is the minimum between the piecewise linear 
function of (k, (r — k)(t — k), for k = 0, . . . , min{£, r} and 



ill- 

v r 

where < m < min{r, t}. This tradeoff can not be achieved using the elementary compression, only using the 
CEO approach. 

This Proposition is proved by showing that the upper bound on the DMT from Proposition 0] is achievable. 
Proof: Consider again C, = ^ log(P) + e and then fix % = 0.5e in equation d48b . Let us write the diversity 
here as dcEO, where CEO stands for chief executive officer 

log(Pr(outage)) 

dcEoym) = - lim : — — = 

p^oc log(P) 

mm QeV Pr(log 2 dct(f sl + (l- 2-° ^)H S QH*) < f\S\\og(P) - 0.5\S c \e) 
mm — lim ■ ; — - -. (50) 

SC{l,..,r} P— oo log(-P) 

The difference between the upper bound in equation ( l22l and (l50b is with the attenuation of (1 — 2 5e ). Since this 
attenuation diminishes as P gets large, it is evident that we get the same diversity as the upper bound. 

Next, we show the achievability of the full multiplexing gain, thus proving the DMT. We get the following 
achievable rate: 

R C EO = rh log 2 (P) + o(log 2 (P) ) , (51) 
where fh — min{r, t} and limp^oo °^°^p^ = 0. This is since 

^ min ^ {\S\™ log 2 (P) + min{r - \S\,m} log 2 (P) + o(log 2 (P))} = mlog 2 (P) + o(log 2 (P)) (52) 
is fulfilled with S = <fi and S — {1, . . . , r}. ■ 

C. An Achievable Rate For the Case of Multiple Antennas Also At the Agents 

The case of multiple antennas at the agents is different than the above case, where only a single antenna was 
used by the agents, in that now the agents can use more elaborated processing in order to improve the overall 
performance. We consider here only ergodic channel, where the block fading case follows the same line. 

The channel can still be described by (O, only that now, Yj(fc) is a vector, taking values from C[ riXl l, Ni(k) ~ 
CN(0,I ri ), and H t (k) £ & nxt \ again with elements that are independently and identically distributed, according 
to the circular-symmetric complex Gaussian distribution with variance of 1. 

The difference between this scheme and the previous one, is that now each agent can add non-white quantization 
noise (but still input independent) to the received vector, where such dependency can improve the resulting achievable 
rate, by improving the estimation at the final destination, through better utilization of the lossless links. 
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Proposition 10: An achievable rate, over an ergodic channel, with several receiving antennas at each agent, is 



Rceo = max min 

{A i (H):O rxt 'i^B i }': =1 SC{l,...,r} 



E J - lo S2 I'm, + Ar 1 !] + log 2 

I i£S c 



7 E ies m t + ydiag ((I„ H + A*)^) H s Hg 



where 



H, 



( ) 

V i / 



, (53) 



(54) 



ies 



and 



Mi = {M : M e C m xm , m! < min{r 4 , t}, M t 0}. (55) 
To achieve this rate, each agent performs singular value decomposition of Hi = ViTiUi, so that Vi € C^ iXri l and 
Ui £ C[' X< 1 are unitary matrices, for calculating v*Yi. Then each agent looks for U™ which is jointly typical with 
(v*)Yi, when Ui and ViYi are distributed as 

U i =v*Y i +D i . (56) 

Here Di is random vector, independent with Yi, distributed as J\fC(0, Aj). Define rrii = rank(Fj) and redefine the 
matrix I\ to include only the non-zero elements in IV The matrix Aj € C^ miXmi ^ represents random variables, 
like in the previous section, only here it is a vector instead of a scalar. 

Note that Q = K-It is optimal in d53l as in ( |34l . By assigning = 1, Aj = Pd 4 and noticing that TiUi = Hi, 
we see that indeed d53l coincides with (|34] >. as expected. 

V. Upper Bounds 

In this section several upper bounds are derived, for both fast fading and block fading cases. 



A. Cut-Set Upper Bound 

The simple cut-set upper bound [26], although quite intuitive often provides good results. This bound is very 
general, and is not limited to the nomadic setting. 

Corollary 3: Cut-set: Any achievable rate in the system is upper bounded by the cut-set bound, 



I(X:Y S \H)+ C * 
For the ergodic fast fading channel, this upper bound equals 



R < min 

SC{l,...,r} 



iES c 



R < min 

SC{l,...,r} 



E H log 2 det (J|s| + jH s Hl) + £ Q 
Where for the block fading channel, the rate vs outage is limited by 



Pr(outaqe) = min Pr \ R > min 

Qev \ SCjl,..,r} 



log 2 det (7| S | + H S QH%) + J2 C i 



%es c 



(57) 



(58) 



(59) 



The proof is based on [26], considering also the proof of Proposition |U and is omitted due to its simplicity. 
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B. Upper Bounds for Nomadic Transmitter 

The upper bounds here are calculated assuming nomadic transmitter, who uses circular-symmetric complex 
Gaussian codebook. Thus they show what cannot be achieved, no matter what processing is used at the agents, as 
long as they are ignorant of the codebook used. In the following, we first upper bound general channels, and then 
apply the bound for ergodic channel and the block fading channel. 

Proposition 11: The achievable rate for reliable communication is upper bounded by: 



!>' : min J - ft] + -I(X; V s c\H) + -\. (60) 

Proof: We first give an information theoretic upper bound for the achievable rate, based on [1]. We define Vi 
to be the message sent from agent Ai after receiving n channel outputs. Notice that H is fully known to all agents 
and to the final destination, so they can use it to calculate the {Vi}. 

For any subset S C {1, . . . , r}, the following chain of inequalities holds: 

y2Ci>-I(Y r ;Vs\V s o,H) (61) 



n 

ies 



-I(Y r : V r \H) - -I{Y r : V s c \H) (62) 

n n 

-I{Y r , X; V r \H) - -I(Y r , X; V s c \H) (63) 

n n 

-I(X; V r \H) - -I(X; V s c \H) + -I(Y r ; V r \X, H) - -I(Y s c ; V s c \X, H) (64) 

n n n n 

-I(X-V r \H)--I(X-V s c\H) + Y^ qi - V ft (65) 



i=l ie s c 



-I(X; V r \H) - -I(X; V s c \H) + V ft. 
n n i — ' 



(66) 



where j63l is because Vi is a function of Y \ and H, so we have the Markov chain Vi — {Yi, H} — X and ft is 
defined by ft = ^I(Yi] Vi\X, H). By changing order we get 

-I(X-V r \H) < y\d - ft] + -I(X-V s c\H). 



n 

ies 



Next we utilize Fano's inequality 



(67) 

(68) 
(69) 
(70) 
(71) 
(72) 



The following lemma, which is proved in the Appendix, is required for obtaining computable upper bounds (single 
letter upper bound). 



R < 


—H(M) = -I(M; V r , F\H) + 
n n 


-H(M\F,V r ,H) 
n 


< 


-I(M;V r ,F\H) + P e 

n 




< 


-I(M, F; V r \H) + P e 
n 




< 


-I(X(M,F);V r \H) + P e 

n 




< 


y\C l -q. l ] + -I{X-V s c\H)- 

ies 


+ Pe- 
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Lemma 1: If the transmitter is nomadic, so the agents have no decoding ability, and the transmitter uses Gaussian 
codebooks, the following inequality holds for any SC{l,...,r}: 

-I(X; V S \H = h)< mlog 2 ( J[ |j, 5 , + A 5 (fc)| - - J[ \W s {k)\™ ) (73) 
where A 5 (fc) = H s (k)QH* s {k), 



\k=l k=l 



W s (k) — I V hes (74) 

diag (2-«0O) , £S H s (k)QH s (k)* \S\ < t 

qi (h) = ^I{Y^;Vi\X,H = h) and m = min{i, |5|}. 

Since HQH* is distributed the same as HU*YXJH*, when J7 is a unitary matrix and £ is diagonal, Q can be 
restricted to be diagonal in (l73l . However, unlike the achievable rate, which is a concave function of Q, so that 
Q oc I is optimal, the right hand side of (l73l is not concave in Q, thus in the sequel, we let Q be such that Q E V. 
Notice that the inequality in d73l is tight when the channel is H — (1, ... , 1) T , which corresponds to the Gaussian 
CEO problem with quadratic distortion [15]. 

1 ) Upper Bound for Fast Fading Channel: We begin the derivation of an upper bound for the fast fading channel 
by evaluating the bound of Lemma Q] for the fast fading: 

Corollary 4: In the limit of n — > oo, due to the ergodic fading process: 

lim -I(X;V S \H = h) < F{S,q s ) (75) 

n — >oc Tl 

where 

F(S,q s ) = mlog 2 f2- E «« log = l /+As l - 2- E »w ^ \W S \\ ) (76) 



and we use the notation % = and = {(/ijies, and A$ = As(l), W5 = W^s(l)- Consequently, (T75T > can 

be averaged over the channels: 

lim -I(X;V s \H)<F(S )qs ). (77) 
The dependence of i 7, from (|76| | on gj, stems from the definition of g,, as the bandwidth used for the noise 
compression, and is essential for the bound, as it is used for connecting the bandwidth for the signal compression 
to the achievable rate. Combining proposition ITTI with Corollary 0] above, we get the following proposition: 
Proposition 12: The achievable rate of a nomadic transmitter, over fast fading channel, is upper bounded by: 



R < max 

Qev,{o< qi <c 



} l scfT r} { FiSC ' <1S ' + J [Q ~ <1,] } } ' (?8) 

Remark 5: When C 2 : = C for i = 1, . . . , r, then the argument which is maximized over in ( fTBl , is 

symmetric in Since the argument is also concave in {<7;}[ =1 , for Cj = C, equation d78b is maximized by 

qi = q* for i = 1, . . . , r. So that for the symmetric case: 

i?< max min |F(5 C , if) + 151 [C - q*]\ . (79) 

QeV,0<q'<C SC{l,...,r} 1 ' 

Following remark |5J we give a special case where the upper bound in proposition [12] is tight. Notice that in this 
case, the optimal compression strategy used by the agents, is with fixed q* — qi. This means that the each agent is 
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required to know only its own H L , and not the other agents \Hj\j^ L . Furthermore, notice that this conclusion is 
due to the tight upper bound, and is not trivially obtained from the achievable rate d34b alone. 

Corollary 5: The CEO approach is optimal for infinite transmission power, Q = -j/, and = C, i = 1, . . . , r 
Here we take P — ► oo, and fixed t and r. 

Proof: We show it for r < t, where the proof for r > t follows the same lines. 

The achievable rate: Taking P — > oo and optimizing over qcEO (where qi — qcEO- i = 1, . . . , r in equation (l34l >) 
instead of over {qi}, results with: 

-I(X; V S \H) = mlog 2 (P) + E H log 2 |~ff s ffj| + mE ff log 2 (l - 2-*°»°) + o(P), (80) 
n t 

where o(P) — » when P — > oo. 

r/ze upper bound: On the other hand, taking P — * oo equation (|75] l becomes 

F(S,g s ) = mlog 2 ( 2 1 °s 2 (P)+iE H iog 2 |^ 5 H|j j 2 o(P) _ TJ 2 -#i 

mlog 2 (P) + E H log 2 \jH s H* s \ + mlog 2 ( 2°^ - J] 2"* ) , (81) 

Since d — C, equation (l78l is a concave symmetric function of {qi}, the solution is when all {qi} are identical, 
denoted as qi — q UB . So (|8TT ) becomes 

F(5,<^ B ) = mlog 2 (P) +E ff log 2 +mlog 2 (2°( p ) - 2^ B ) . (82) 

which is identical, in the limit, to (f80b . Substituting (f80b in ( [34-b and ( f82l in ( l79b gives the desired equality. ■ 
For P — > oo and Cj = C, there is no need to perform expectation over H of the rightmost element of dHOl . since 
taking gcfio = quB results with the optimal rate. This means that for large P and symmetric links, the compression 
parameters are independent of H, which in turn means that the i-th agent needs to know only its own Hi. Notice 
that the channel state information (CSI, Hi) is still required at i th agent, for the determination of the codebook of 
U (see [1]). This is unlike the classical Gaussian Wyner Ziv problem, which does not benefit from side information 
at the encoder. 

The upper bound of proposition fT2l is not tight because the upper bound in Lemma Q] was obtained using the vector 
version of the entropy power inequality. This inequality is known to be tight only for proportional correlation 
matrices, which is not our case. Thus the entropy power inequality introduces a gap that prevents the bound to be 
tight. This gap can be mitigated by taking into account smaller matrices. The following proposition improves upon 
proposition [T2l by optimizing also over sub-matrices of S: 

Proposition 13: An achievable rate of a nomadic transmitter, which uses circular-symmetric complex Gaussian 
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signalling with total power P, through agents with bandwidths {Ci} is upper bounded by: 



R u = max < 

Qev{o< qi <Ci} T i=1 



U$ =1 Zj -C {1,..., r}, U =1 



(83) 



j : Z j nZ i = (f) 
where F(Zj,qz.) is defined as before, in equation ( 1 751 ). 

The proof is very simple, considering for every group of disjoint subsets ({-2j}J_i : Zj (1 Zi = <fi when i ^ j) 
that cover Uj =1 Zj — S we can write: 



7(X ; y s |ff)<^7(X;^.|ff), 



(84) 



which is due to the Markov chain Vj — X — Vi when i ^ j, and then using the upper bound of proposition [121 again, 
for every element. Since the entropy power inequality, which is used in proposition [12] is not tight (in general) for 
the Gaussian vector case, but is tight for the Gaussian scalar case, this upper bound can improve upon the latter. 
For the symmetric case, where d = C for i = [l,...,r], due to the concavity of d53l , the maximum in d83l is 
achieved with qi = q*, i = [1, . . . , r], so that ( |83l is written as: 



max 

Q£V,0<q*<C 



(r-X;^i)(^-«*) 



% > 



(85) 



By solving the above optimization problem for {fcj}^ =1 and then solving for q* by explicitly writing F(j,q*) we 
can simplify (|85l l to 

Corollary 6: The achievable rate of nomadic transmitter in the symmetric case, Ci — C, i = 1,. .. ,r, is upper 
bounded by 



i? us ^ rC + r 



max jorin i^E Hj log 2 |7,- + 77^77*1 - log 2 (2° + 2l E »i I (86 ) 



where 77, is the fading coefficients seen by any subset of j agents (since the channel is ergodic, it does not matter 
which subset). 

The improvement of the bound from proposition Qj] over the bound from proposition Q~2] is seen in the next 
corollary, where the inequality ( |84l l is in fact an equality, and a conclusive result is obtained. 

Corollary 7: The CEO approach is optimal for Q = y7 and t — > 00 while r is fix. 
The bound (l83l is tight, when t — > 00 and Q is a multiplicity of the identity matrix. This is since HQH* is 
proportional to the identity matrix, each agent receives independent reception. This means r parallel links that can 
be optimized separately. Namely, when t — > 00 while r is fixed we get 



lim -HH* = I r 

t—>oc t 



(87) 
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Proof: 

The achievable rate: assigning the limit d87l > in d34b . we get: 



Hm R(H) = max I mm j £ [Q - q t ] + £ log a (l + P(l - 2"*)) 1 1 



(88) 



Notice that (1881 is independent of the channel realization H. 

The upper bound: On the other hand, taking Q = and jHiQH* = 1 for the calculation of F({i}, qi) in (l75l l 
gives log 2 (l + P{1 — 2~ 9i )). Assigning back to equation (l83l >. with Z,; = {i} results with: 



lim R u 

t — >oo 



max 

{0< 9i <Ci} 



mm ■ 



£ [Q - ft] + E l0 S2(l + ^(1 - 2^0)1} 



(89) 



which equals (T88b and proves the optimality. ■ 
2) Upper Bound for Block Fading Channels: In this subsection, we will consider the case of H distributed 
independently, but once per block, such that H = H. The resulting rate in equation (f34t is actually the average 
rate, supported by the scheme. In the sequel of this subsection, we will upper bound the rate- vs. -outage of the 
scheme. 

For the upper bound, we again use: 



R(H = h)< maxmin { -I(V S ;X\H = h) + V [C t - qA \ 



For I(Vs; X\H — h), we use the upper bound of equation (P73l l. Since H = H, we get: 



G(5, q s ) - mlog 2 (| 7,51 + A s | - - |W 5 | • 

-I(X;V s \H = h)<G(S,q s ) 
n 

where As = HgQH s , as before and Ws is defined by from equation ( |74l i. Combining d90l ) and 

noticing that H is a random variable, we get the following upper bound on the outage e vs. rate R: 

Proposition 14: An upper bound on the achievable rate R, for given outage probability e is the minimal R which 
fulfills: 



(90) 

(91) 
(92) 
and 



max 



nax J min J G(S, q s ) + V [d - q t ] \ \ < R ) 



< e. 



(93) 



Actually, we can improve upon j93l , the same it was done in proposition [T3| 
/ 



P 



max 

QeV,{0<qi<Ci} 



> < R 



< e, 



(94) 



UJ =1 ^C{l,...,r}, 

V [ i + j : Zj n 2i = ^ J 7 

but since the problem is not symmetric (due to the non-ergodic H), we can not further simplify it, as in Corollary 
[6] However, the limiting behavior of (f87T > is true also for the block fading case. Thus the optimality of the CEO 
approach when t — » oo from correlation [7] is assured for the block fading case as well. 
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P[dB] 

Fig. 3. The achievable rates compared to the upper bounds over a 2 X 2 system with C = 2: for fast fading Rayleigh channel with upper bound 
according to an arbitrary Q (Q singular) and to a fix Q = yit (Q identity), and for block fading Rayleigh channel, with outage probability of 
10 -2 , where the upper bound was calculated from {94). All as a function of P in dB, where the outage probability and the average over H 
were done by Monte Carlo simulations over H. 

C. Discussion 

When considering the upper bound, several clarifications are in order. It is known [15], [1] that when no fading 
is present, and the transmitter has only a single antenna, the upper bound is in fact tight. It means that when the 
sum Yli=j is sufficient statistics, the capacity is established. This situation changes when considering fading 
channels. It is evident from [27], that when Y\ — Y 2 is sufficient statistics, using our technique, which is based on 
the Berger-Tung CEO, is strictly sub-optimal and lattice approach can outperform the random binning. Therefore, 
it is not expected that ultimate performance is achieved, although the upper bound proximity to the achievable rate. 

VI. Numerical Example 

The achievable rates and the upper bounds for both fast fading and block fading channels, were calculated for 
a 2 x 2 system, with C\ = C-2 = 2, for several signal to noise ratios (P in dB), and the results are presented in 
figure [3] For the fast fading, both achievable rate and upper bound are obtained by averaging over 30 blocks, each 
containing 50 channel realizations (the expectation expressed by Eh in d86*l > and d34l>). It is seen there that the 
upper bound is convex, and that it is close to the achievable rate, when using CEO compression. For the lower and 
higher P the bound is tighter. 

For block fading channel, the upper bound from d94l i is depicted along with the achievable rate d48l) . for outage 
probability of e = 10~ 2 . The probability was calculated using Monte Carlo simulations over 10000 different 
realizations of H. It is seen there that the bound is again very tight for the low SNR region, and the gap becomes 
higher, with larger SNR, although it remains rather small, no more than 1 dB throughout the figure. 
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VII. Agents with Code Knowledge, and Fully Informed Transmitter 

In this section we consider the same model, as in the previous sections, with two differences. One difference is 
that we drop the nomadity assumption, and let the agents be able to decode messages. The second difference is 
that we assume full CSI (H) at the transmitter, in a non casual sense, so that the transmitter and the agents have 
the same channel state information. 

We get to the following proposition, which is proved in the Appendix. 

Proposition 15: In the ergodic regime, when the transmitter has full CSI, and the agents are cognizant of the 
codebook used, the rate ( 1951 ) is achievable 



R c 



= maxmm -!(£/*; Yi\X,W r ,H)} + I(U s c; X\W r , H) + ^ [I(W l ;Y l \H) ~ I(W l ;W f{ ^^\H)} I 



where 7r is a permutation of [1, . 



and 



f(n,i) = {tti, ...,i}, 



(95) 
(96) 



Pw-,x,Y r ,U''\H(w r ,x,y r ,u r \h) = P w -\H{w r \h)Px\w-M{A wT \ h )X\[ p Y z \x,H{yi\x,h)P u .\Y u w i ,H{^^^ 

i=l 

(97) 

The transmitter sends messages to the agents via the broadcast channel [6], by using the dirty paper coding (DPC) 
technique [28]. On top, the transmitter also sends information to be decoded only at the final destination, invoking 
the nomadic techniques of the previous scheme. We actually extended the results of [1], to include also DPC and a 
random ergodic channel. In [1] Corollary 4, the superposition coding combined with the CEO technique, was used 
for that setting, when no fading was present, and when the channel was degraded. The main difference between 
superposition coding and DPC is in that superposition coding lets the destined terminal cancel the interfering 
transmissions (which are destined to terminals with weaker channels) and the DPC performs precoding, so that 
interference transmissions are canceled at the transmitter (thus the name dirty paper coding). 

Next, for the fading Gaussian channel, the combined final destination decoding and DPC results with the rate 



Rdpc,i = max mmE H l ^[C, - qi 



log 2 



ies 



V, + diag(l - 2-^)H s c [q-^bAHZo 



lo s2 



i + ffi Q-E 



ies c 



l + H, 



(q-e 



jeT(,r,i) B J - Bi i 11 ' 



where the maximization is over 



q; 

Q, Bi 



C [rxt] ^R+, 
c [rxt] _^ c [txt] 



(98) 

(99) 
(100) 



such that Q, Bi y 0, Q — J^i Bi ^ and E#[trace(Q)] < P. The rate in d98l l can be increased by convex hull 
[28], since in general, this problem is non concave. 
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This rate is achieved by using W{, as in [28], and then Pui\Yi,Wi,H = Piii\Yi.H remains the same as in Proposition 



[T] The situation in the compression stage, is similar to when using Wyner-Ziv source compression with decoder 
side information over Gaussian sources, where supplying the side information (Wj) to the encoder does not improve 
the rate distortion. 

Although calculating d98l is hard, due to the non-convexity of the problem, note that a sub-optimal rate can be 
calculated for the symmetric case (C, = C), by using the DPC such that the maximal sum-rate is obtained, and so 
that Q = l t — t — Y^=i Bj tl 0, and letting E#[P'] < P. Since the problem is symmetric and the channel ergodic, 
each agent decodes the same rate. The DPC sum-rate can be obtained by the dual multi-access (concave) MIMO 
channel [29]. 



In this paper we showed the effectiveness of several compression techniques for decentralized reception in fast 
fading and block fading MIMO channels. We proved that in many cases, the elementary compression is sufficient 
to get the full-multiplexing gain. In addition, we showed the advantages of the CEO approach, which were evident 
in an asymptotic analysis and in a finite example. We presented upper-bounds for both fast fading channel and 
block fading channel, which are based on the nomadic characteristic of the scheme, along with the EPI, and which 
turned out to be quite tight even for relatively small 2x2 scheme. Achievable rate for a non-nomadic scheme was 
finally derived, combining the decentralized processing with the DPC. 



This research was supported by the EU 6th framework program via the NEWCOM network of excellence. 

Appendix I 
Useful definitions and Lemmas 

Let Pa 1: a 2 ,...,a l ( a i, a 2, ■ ■ ■ ; a L) be the probability function of the random variables A\, . . . ,A^ which take 
values in Ai, . . . , Al, respectively. 
Definitions: 

1) The marginal probabilities are then defined as 



VIII. Conclusion 



Acknowledgment 





(101) 



(£ is the set {!,..., L}). 



2) The conditional probabilities are defined as: 



P 



X\A s (ai\a s ) 



Pa,, a s ( a h as) 
Pa s (as) 



(102) 



for some SC£ and I ^ S and PA s ( a s) 7^ 0. 
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3) As commonly done (see [26], section 13, problem 10), define the e-typical (strongly conditional typical) set 
T e of ac as the set for which N(as,h\as,h) = for any as € As,h 6 H such that P As \ H (as\h) = 0, 
and also 

T e (h) ± {ac : VSCC, Va s eA s ,heH ^\N(a Sl h\a Sl h) - P AslH (a s \h)N(h\h)\ < 

(103) 

where N(as\as) denotes the counting operator of the number of occurrences of the symbol ag in the vector 
a s . 
Lemmas: 

Lemma 2: For any e > 0, there exist n* such that for all n > n* and randomly generated ac according to 

U.PAc\BMk)\h(k)) 

Pr{a £ e T e (»} > 1 -e. (104) 
Lemma 3: Fix some S C £ and probability 

Define the jointly e-typical set T e (/i), as before, by the joint probability ( 11051 ). 
Let a n c be generated according to 

fe=l I JGS J 

where the conditional and marginal probabilities are calculated from d 1 05b and wc is a given vector which was 
randomly generated and that belongs to the set T e (h), as defined by ( 11031 ) (that is, there exist ac that are jointly 
typical with wc)- 

Then the probability of the vector ac which is distributed according to ( 11061 ) to be in T t (h), which is defined 
according to (1105b is bounded by: 

Pr{(ai,...X,W C ) e T e (h)} > 2 -n[H(A s c\W c ,H)-H{Ac\Wc,H) + ^ s H{A l \W l ,H) + e,] (m) 

Pr{(oi,... lL ,11> jC ) e T e (h)} < 2 -n[H(A sC \W c M)-H{Az\WzM) + Y. les H(A l \W l ,H)-e 1 ] {m) 

where t\ — > as e — > 0. 

Lemma 4: Generalized Markov Lemma 
Let 

PA s ,w s ,Y s \H{as,ws,ys\h) = Pws,Ys\H(ws,ys\h)Yl p A t \Wt,Y t ,H{<h\wt,yt,h). (109) 

les 

Given randomly generated wsy s according to Pw s ,y s \h> f° r every i £ <S, randomly and independently generate 

N . > 2 n/(A i; y 1 |w 1 ,ff) vectors a, according to Ij£ =1 p A 4 |w<,ff(2i(Jfe)K(fc)> M fc ))> and index 1116111 b Y (1 < 
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t < Ni). Then there exist |<S| functions t* = 4>i(y i ,w i , aj , . . . , a\ Nt ^) taking values in [1 . . . N t ], such that for 
sufficiently large n, 

Px(({ap} ies ,ws,y s ) e T e (h)) > 1 e. (110) 

Proof: See [26] and [30] for the proofs of Lemmas [2][3] while Lemma [4] is a simple extension of Lemma 3.4 
(Generalized Markov Lemma) in [31]. ■ 
In the following, we use only e and remove the distinction between e and ei, for the sake of brevity. 

Appendix II 
Proof of ProposftionQ] 

A. Code construction: 
Fix S > 0. 

1) For the transmitter, for any codebook used, / 

• Randomly choose 2 nRcEO vectors x, with probability Px(x) = Ylk Px{x(k)). 

• Index these vectors by M C eo where M C eo S [l,2 nRBC ]. 

2) For the compressor at the agents 
For every channel realization h 

• Randomly generate 2 nCi vectors itj of length n 
according to J] fc Pu,\h (ui(k)\h(k)) . 

• Index all the generated it; with z, G [l,2 nC *]. 

B. Encoding: 

Let M be the message to be sent, and / is the codebook used. The transmitter then sends x(M, /) to the channel. 

C. Processing at the agents: 

The i th agent chooses any of the Zi such that 

(u^h).^^ eTf c '\h), (ill) 

where Tf c,l {h) is defined in the standard way, as ( 1103b . The event where no such z% is found is defined as the 
error event E\. 

After deciding on the agent forwards it to the final destination through the lossless link. 

D. Decoding (at the destination): 

The destination retrieves z r from the lossless links, and uses h and the random encoding /. 
The destination then finds M such that 

(x(M,f),u r (z r )) GTf c ' 3 (h). (112) 
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Where Tf C3 is defined in the standard way, as ( 11031 ). If there is no such M, or if if there is more than one, the 
destination chooses one arbitrarily. Define error E2 as the event where M ^ Mqeo- 

Correct decoding means that the destination decides M — M, An achievable rate R was defined as when the final 
destination receives the transmitted message with an error probability which is made arbitrarily small for sufficiently 
large block length n. 

E. Error analysis 

The error probability is upper bounded by: 

2 

Pr{error} = Pr (u? =1 .E*) < ^ Pr(£;)- (H3) 

i=l 

Where: 

1) Ei. No Ui(zi,h) is jointly typical with y s . 

2) E 2 : Decoding error x(M, f) ^ x(M, /), so that M ^ M. 

Next, we will upper bound the probabilities of the individual error events by arbitrarily small e. 

1) E\: According to Lemma [H the probability Pr{i?i} can be made as small as desired, for n sufficiently large, 
as long as 

d> I(Ui;Yi\H), (114) 

2) E 2 : Consider the case where M / M. There are 2 nRcEO such vectors, and the probability of (x(M, f),u r (z r )) 
to be jointly typical is upper bounded by (Lemma |3]l 2~ n \- I ( X ' U l^)~ c ]. Thus the rate Rceo is achievable if: 

Rceo < I(X;U r \H) - e, (115) 

which proves Proposition [TJ ■ 

Appendix III 

Proof of Optimality of Q = y/ t for the Ergodic Channel. 

First consider that since the channel is unknown to the transmitter, and VH is distributed as H when V is unitary 
(eigenvectors of a non diagonal Q) all through this work, Q can be limited to be diagonal. 
Next, for any given qi(H) and S, we have that 



E 



H 



log 2 det (l ls] + diag (l - 2~ q>{H) ) teS HsQH^j 



(116) 



is a concave function of Q, which is thus maximized by Q = [2]. Thus it also maximizes the maximum over 
all qi(H) and S concluding the proof. ■ 
Notice that this proof does not extend to ( TToT > and to Q5b . so that there, the optimal Q may not be proportional 
to identity, but is still diagonal, though. 
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Appendix IV 
Proof For Proposition [7] 

In this Appendix, we give a closed solution to the r = 2, symmetric case. We extend what was done in [1] to 
the ergodic channel case, with t > 1. Equation d34i > for the symmetric case can be written as: 



where 



Rceo = max < min {{& \[C - q*] + F s (q*)} \, (117) 

0<q*<C [5C{l,...,r} J 

F 5 ( 9 *) = max E H log 2 detf/| S | + ^diagfl-2-*W N ) H S H*) (118) 

such that 

V H [q t (H)]=q*, i = l,...,r. (119) 

Since the channel is ergodic, and the scheme symmetric, the users will be equivalent, and due to the concavity of 
the problem, the optimal solution is characterized by q* — E/f[rj(i?)]. That is, equal bandwidth that is wasted by 
all users on the noise quantization. By writing the equation this way, the ergodic nature of the channel is used, 
such that the channel randomness is limited to within F$. Since F$ is an increasing function of q*, when solving 
it, the solution of d34t is readily available numerically. So we are left with the concave problem of finding F$. 

Since Fii t __ r \(q*) is an increasing function of q*, and r(C — q*) is a decreasing function of q*, the point 
F{i,...,r}{q*) = r (C — q*) exists, and further, it is an upper bound to the achievable rate. Next, using Hadamard 
inequality we have that for any S 

log 2 det (l 2 + |diag(l - 2-vy i=1 HH*^j < log 2 det (l s + jdiag(l - 2-^) ieS H s H* s 

+ log 2 det (l s c + ydiag(l - 2-« i ) ieS cH s cH*J) . (120) 

Since the channel is ergodic, the minimum in (II 17b is over functionals of the channel probability, rather then channel 
realizations. In addition, the channel probability is symmetric with regards to the agents, leading to i 7 ^ ... ir \(q*), 
which is the minimum among all the subsets S. So that the achievable rate can be calculated by solving the following 
problem 



max 

{g < :C['-x*l-»R + }r =1 



E H log 2 det ^/ 2 + ydiag(l-2- <J '( H }) r ^HH*^j (121) 
such that qi(H) > and 

E H [ qi (H)} = q*, i = l,...,r. (122) 

Let us limit the discussion to the case of r = 2. The solution can be obtained through Lagrange multipliers, as 
follows (9 > 0) 

P 



Vlog 2 det (l 2 + ydiag (l - 2- q ^ H) y ^ ^ HH* ) - 0I 2 = [i{ H )- ( 12 3) 



So for any Hi(H) = 0, such that qi(H) > 0, we get that (i = 3 — i) 



2'^(A 2+l - 2~ q 'A 2 ) 
£=1,2: V 2+ ' A 2 -±=d, (124) 
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and Ejj(gj) = q*, where 



dot f I 2 + -diagf] 2 



HH* 



i=l,2 



Ai == det J 2 



P 



det 



-HH* 



A 3 == det (diag([0, 1]) + y-fftf* 

A 4 = det^diag([l,0]) + y^* 
We note that ( I124l i determines a one-to-one connection between and q* . In addition, note that 



(125) 
(126) 
(127) 
(128) 
(129) 



A = Ai + 2- qi - q2 A 2 - 2- qi A 3 - 2~ q2 A 4 , 



and that 



A 3 = A 2 + 
A 4 = A 2 + y|iJ 2 p 



The solution of (11241 ) is 

Qi = ~ log 2 



A 



i+2 



2A 2 (i + e) 



{1 + 29) - J (1 + 20) 2 -40(1 + : 



t AjA 2 

'a 3 a 4 



(130) 
(131) 

(132) 



We note that 



AiA 2 
A3A4 



< 1 with equality if and only if HH* is a diagonal matrix. So the square root in equation 
( 1132b is guaranteed to be positive real. By a simple derivative, it is easily verified that Fh(9), defined by (l38l ). is 
monotonically increasing with 9. 

Then, in case any of g,, i = 1, 2 from ( 11321 ) turns out negative (say Fh{9) > ^ which leads to qj < 0), then 
the solution is qj — and q t is equal to 



q t = - log 2 



l + f\Hi 



1 



(133) 



If d 1 33b is negative as well, the solution is = 0. As 6 gets smaller, more channels will result with d!32t solved 
with qi > 0, which means better compression. 
Overall, we can write 



Now 9 is determined by the equation 

P 



W ( 1+ T|gll 



E ff log 2 det [I 2 + jd 



iag ( 



l-2-«( ff - fl )] HH 



Fh(0) > if 



2(C-E i? fe(ff,0)]) 



and the achievable rate is 



RcEO = 2{C-E„[ qi (H,0)]). 



(134) 



(135) 



(136) 



This concludes the proof. 
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Appendix V 
Proof of LemmaQ] 

The proof is divided into two sections, we start by proving for the case where |<S| < t. This division is since 
the first case is easier to show, and thus gives better understanding of the guidelines and techniques, which are 
identical, albeit more involved, for the case of |«S| > t. 

For the sake of the proof, define: 

. Z = H S X, where I(Y S ; X\H) = I(Y S ; Z\H). 

. A z 4 E[ZZ*} = H s QHg = ^H S H* S (equal to A s ). 

• Z = AY, where A is the best estimator of Z from Y, calculated as A — A Z (I + A z ) _1 . 
Since |<S| < i we have that A z | > 0. Note that since Z is the best estimator 

Z = Z + N (137) 

where Z and N are independent, and since E{ZZ*\ = A* (J + A Z ) _1 A Z , we get E[AW*] = A Z (I + A z ) _1 . Now 
we can rely on the independence in (1137b and the vector entropy power inequality: 

|A 2 (fc)| 



fc=l 



Next we express the required quantity A = j;I{Z; V$\H = h) in both sides of ( 11381 1. For the left hand side, 

1 1 A 1 ( n \ A 

m KZ\ Vs , H = h) = —h(Z\H = /»)__ = _ l 0g2 (ft \A z{ k)\ j + log^e) - -. (139) 

The right hand side is more elaborated, and will be done in two stages. First note that: 

h(Z\V s ,H = h) = h(Z\Z, V s , H = h) + I(Z; Z\V S ,H = h). (140) 

We know that h(Z\Z, H = h) = h(Z\Z, V S ,H = h), from the definition of Z and Vs. This means that: 

-I(Z; Z\V S ,H = h) = -I(Z; Z\H = h) - X = - \og 2 I f[ \I + A z (k)\ ) - A. (141) 
n n n \ ) 

Second, we have that Z = AYg, so 

h(Z\Z,Vs,H = h) = h(Y s \Z,V s ,H = h) + 2log 2 (f[\A(k)\J = 

J2h(Y i \Z,V i ,H = h) + 2lo g2 (f[ ^%r) ■ (142) 



A 1 



define qi(h) = iJ(Y"j; Vi|X, H = h) and since we used additive noise with unit variance, 



rewrite (11421) as 



-h(Yi\Z, Vi, H = h) = log 2 (^e) - qi (h). (143) 
n 



h(Z\Z, V s ,H = h) = \S\ log 2 (7re) - ]T qi (h) + \ log 2 \J[ ^fj^ j ■ (144) 
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Now using ( 11411 ) and ( I144l i in the right hand side, written in ( 11401 ). we get to: 

2 1 

^M^^, =K rj 2 -^ (ft ^ti^k)^ 1 (ni^ + A,(fc)iy SI 2-A. (145) 
Finally we combine left hand side (11391 ) and right hand side ( 1145b , and get 

„ C 2-*> (niA.wi)* > „ 2 -*n^ (ii^^)* (ni^A.wi)* + «n 

V " "" S k 1 (146) 

Reordering the equation we get to (1731 . which proves Lemma [T] for when |iS| < i. 



We continue to the case where \S\ > i, where we have more agents than transmitters, so that \A Z \ = 0. Like in 
the previous setting we define X = AY to be the best estimator of X out of Y. So that now A = QH*(I + A 2 ) _1 , 
and we have 

X = X + N, (147) 

where X and N are independent and using the matrix inversion Lemma E[7V7V*] = + H*!!)^ 1 = Q(I + 

QH*!!)^ 1 . Again we use the entropy power inequality: 



2 £ t h(X\V Bt H=h) > 2 ±h(X\VS,H=h) +7re |Q|i TJ f * 



Using the same argument as the one used for J 1 39b . the left hand side of ( 1148b becomes 



(148) 



2 &fc(*|V*,H=h) =7re |Q|* 2 -$_ (149) 

The left expression in the right hand side of ( 1148b can be written as the sum of two arguments, as in ( 1140b . where 

the right-most mutual information (like ( 1141b ) is 

1 1 1 f n \ 

-I(X;X\V s ,H = h) = -I(X;X\H = h)-X=-\og 2 TT|/ + A z (fc)| -A. (150) 

n n n \ J -- L ; 

The difference between the case where |<S| < t and |<S| > Hs evident in the derivation of (11421) . which for |<S| > t 
requires the double use of the entropy power inequality. So we want to lower bound h(X\Vs, X, H = h). First, 
let us decompose A using the singular value decomposition into A = U1DU2, where U\ G C tx * and U2 <E C' 5 ^' 5 ' 
are two unitary matrices and D £ R**! 5 is diagonal matrix. So we have that: 

h(X\V s ,X, H = h) = h(U 1 DU 2 Y\V s ,X, H = h) 

n t n 

= log 2 ]l\Um\ 2 + ^og 2 U\ D ^^\ 2 + h ^ U2 ^ Y \ V ^ X ^ H = h ^ ( 151 > 

3=1 

since U2 is unitary matrix. Next we employ the entropy power inequality to lower bound h((U2)jY\Vs, X, H = h): 

n 

2 h((u 2 ) jY \v s ,x,H=h) >^2 h ^l^' x ' Jf = ?l )[]|(C/ 2 (A ; )) J - i | 2 . (152) 

ies 
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This inequality is achieved with equality for Gaussian variables. A lower bound on ( 1151b is given by 

h(X\V s ,X,H = h) >log 2 ^|C/ 1 (A : )^(fc)C/ 2 (A ; )diag(2^( h )) ie5 Lr 2 (A : )^(A : )*C/ 1 (A : )*^ + nt\og 2 (ne) 

= log 2 (j[ I + A 2 (fc))- 1 diag(2-^( h )) ie5 (/ + A^k^H^Q^ + nt log 2 (^e) 

= l0 ^[n{ \i+A z ( k )\ 2 ) + g2{ ] ( } 



since 



\nrr*(TM\ \— i 7" I a Hrrni 1(7 + QH*H)QH*{I + A,)' 1 ^/ + A z )- 1 gQ(I + g*ffQ)| IQiTPffQI 
|Qff(/+A,) D(/+A,) HQ| = ^— gj = ____ 

(154) 

To conclude, we use < fl49l ( fBOl and Q33): 

lnl i„ a n -x (rx lT . , 1 , l \Q\ 2 \H(k)*diag(2-^^) ieS H(k)\\~ A / 1 

TO |Q|t2-t >,re2-- ni^ + Mfc)l M V + W +HQI^ II [WTATk 



(155) 

which by taking expectation with respect to H , together with (11461 ) proves Lemma Q] ■ 

Appendix VI 
Proof of Proposition [HI 
The proof of Proposition [131 is based on the proof of Theorem 3 from [1]. 

A. Code construction: 

For every channel realization h, determine the maximizing tt. Fix 6 > and then 
1) For the broadcast transmissions, for every i = tti, . . . , ir r : 

. Randomly generate 2 n[I{m '' w ^^^ lH}+5] vectors w if accordingto P Wi \H(wi\h) = Uk=i p w t \H(wi(k)\h(k)). 
• For every Wj-^ * generated in the previous iteration, find at least one Wi within the generated set which 
is jointly typical. Joint typicality means that 

K'^tW)) 6 Tf ^(h), (156) 

where 

TfC'^h) 4 J t , : \/S C {t,f (tt,*)}, Vu, 5 eW s i£H 



ilJV^s.fcltes.hJ-JVaiH^slW^)! <7^-ri- (157) 



< In case no such vector exists, declare error event E\, 

. Repeat the last steps for ^i^^-i^W^H)-^ ^ 
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Label the resulting vectors of each repetition, which were jointly typical, by Mi, 

where M l G [i^^W^I^-W^fiM)!^-^]. Then M r = {M lt ...,M r } and further define M Mi as 
the set labeled by Mj. So that w r (M r ,h) are the r vectors which were selected in the last stage and are 
jointly typical. 

2) For the message which is decoded at the final destination, for every w r defined by some M r , and for every 
random encoding realization / 

• Randomly choose 2 nRcEO vectors x, with probability P x \w.h( x I w r i h) = \\ k Px\w r ,H {x(k)\w r (k), h(k)). 
. Index these vectors by M CEO where M CEO G [l,2 nRcBO ]. 

. So we have 2 n[ ^= l I ( w *; r *l JJ )- J ( w *; w f om) \h)-S] different ma ppi ngs between indices M CEO and vectors 
x, where the one used is determined by M r . We will therefore denote x(Mceo, M r ,h) as the vector 
indexed by Mceo, M r . We leave out the notation of / in the sequel, for the sake of brevity, since for 
decoding agents, the chosen / is known at the agents, so the achievable rate is valid for every realization 
of /, with high probability. 

3) For the compressor at the agents 
For all w r indicated by M r , 



. Randomly generate 2 ^-( c '-{ J Wi y 'l«)- J W^.i)l«)-*})] vectors u t of length n 

according to Y[ k P Ui][WuH {ui{k)\wi{k),h(k)). 
. Repeat the last step for S; = 1, . . . , 2 n( - c ^-^^ w ^ H '>- I( - w ^ w ^,^ H '>-^\ define the resulting set of u t 



of each repetition by S Si . 

< Index all the generated u, with Zi G [l,2 nRi ]. We will interchangeably use the notation S Si for the set 

of vectors Ui as well as for the set of the corresponding Z{. 
• Notice that the mapping between the indices z% and the vectors Ui depends on Wi , h. So we will write 

Ui(zi,Wi,h) to denote Ui which is indexed by Zi for some specific w i} h. 



Let M — (M r , Mqeo) be the message to be sent (M r is defined at the previous subsection), and the channel 
realizations be h. The transmitter then sends x(Mceo, M r , h) to the channel. 

C. Processing at the agents: 

1) Decoding: The i th agent knows h and receives y i from the channel. It looks for w,i so that 



B. Encoding: 



{ yi ,Wi)eTf{h), 



(158) 



where 




T^(h)= < 



Wi,Vi : 



VweWuheH: ±\N{w,h\wi,h) - P Wi]H (w\h)N(h\h)\ < ^ 
Vyey^heH: ±\N(y,h\ yi ,h) - P YilH (y\h)N(h\h)\ < ^ 
w G Wj, y G y i ,heH: ± \N(w,y,h\wi, yi ,h) - P w ., Y . ]H (w,y\h)N(h\h)\ < 



February 2, 2008 



DRAFT 



32 



If no such u>i exists, chose arbitrary Wi, and if more than one is found, select one of them arbitrarily. Denote 
by E2 the error event where the chosen vector Wi ^ w.i(M r , h). 
2) Compression: The i th agent chooses any of the such that 

{uiizuwulijiy^wi) eTf(li), (160) 

The event where no such Zi is found is defined as the error event E3. 

After deciding on Zi the agent transmits Sj, which fulfills G S Si , and Mi to the final destination through the 
lossless link, where Af, corresponds to w t . 

D. Decoding (at the destination): 

The destination retrieves M r and s r = (si, . . . ,s r ) from the lossless links. 
The destination then finds the set of indices z r = . . . ,z r } of the compressed vectors u r and Mqeo which 
satisfy 

(x(M C EO,M r , h, f),u r {z r , M r ,h), w r {M r , h)) € T 3 t (h) 

(161) 

z r e S S1 x ■ ■ ■ x S Sr . 

V 

Where is defined in the standard way, as ( 1103b . If there is no such z r , Mceo, or if there is more than one, the 
destination chooses one arbitrarily. Define error £4 as the event where Mceo Mqeo- 

Correct decoding means that the destination decides M — M. An achievable rate R was defined as when the final 
destination receives the transmitted message with an error probability which is made arbitrarily small for sufficiently 
large block length n. 

E. Error analysis 

The error probability is upper bounded by: 

4 

Pr{error} = Pr (uf =1 Ei) < p *( E i)- (162) 

i=l 

Where: 

1) E\\ No r-tuple w r jointly typical is found. 

2) E2: A different Wi ^ Wi is selected by the i th agent. 

3) £"3: No Ui(zi,Wi, h) is jointly typical with (y i: vbi). 

4) E A : Decoding error x(M C eo, Mt, /) + x{M, /). 

Next, we will upper bound the probabilities of the individual error events by arbitrarily small e. 

1) E\: From Lemma|4] it is evident that Pr(£'i) can be made as small as desired, when n is increased, as long 
as 6 > 0. 

2) E2: By Lemma 12 the probability of jointly distributed variables not to be e-typical is as small as desired for 
n sufficiently large. According to Lemma [3] the probability that another u>i belongs to T*' 1 is upper bounded by 

2 -n[J(W«;y«|fl)-e] i Since there are n0 more than 2 n[I{W i ;Y i \H)-I{W i ;W n ^ i) \H)-S] such ^ ^ probab0ity of ^ 

can be made arbitrarily small as n goes to infinity as long as i"(W,; Wf< AH) + 5 > e. 
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3) E%: According to Lemma[4] the probability Pr{i?3} can be made as small as desired, for n sufficiently large, 
as long as 

Ri >I{U i ;Y i \W i ,H). (163) 

4) E\: Consider the case where Mceo ^ Mceo and z$ ^ Zs- There are 

2 ™lRcEO+j: ies [&-(C i -{I(W i ;Y i \H)-I(W i ;W nWti) \H)-6})]] 

such vectors, and the probability of (x(M) , us(zg) , Ugc (zs c )) to be jointly typical is upper bounded by (Lemma 

m 2n [H { X,U^W^H)-H{X\W^H)-H{U sC \W^H)-^ s H{UAW u H) + e]_ ^ ±e rate . g achievable if . 

Rceo < 5^[Ci-{/(Wi; Yi\H)-I(Wa W t(vA \H)-8}-Ri+H(Ui\Wi,H)]-H(Us\X, W r , H)-H(U s c \X, U s ,W r , H) 
< J2[Ci - {I(W i; Yi\H) - J(Wi; W t{n<i) \H) -5}- I(Y i; U t \X, W u H)] + I(U s c;X\W r , H), (164) 

where the second inequality is due to ( 1163b and because of the Markov chain Ui — (W r , X, H) — f7i,...,<-i <+i r - 
Finally, the overall achievable rate is equal to 

r 

Rceo + ]T{/(^; Y t \S) - I{W t ; W f ^ t) \H) - 6}, (165) 
which proves Proposition [T5] ■ 
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